Massively Parallel Cartesian Discrete Ordinates Method for Neutron Transport Simulation. (SN cartésien massivement parallèle pour la simulation neutronique)
نویسنده
چکیده
High-fidelity nuclear reactor core simulations require a precise knowledge of the neutron flux inside the reactor core. This flux is modeled by the linear Boltzmann equation also called neutron transport equation. In this thesis, we focus on solving this equation using the discrete ordinates method (SN) on Cartesian mesh. This method involves a source iteration scheme including a sweep over the spatial mesh and gathering the vast majority of computations in the SN method. Due to the large amount of computations performed in the resolution of the Boltzmann equation, numerous research works were focused on the optimization of the time to solution by developing parallel algorithms for solving the transport equation. However, these algorithms were designed by considering a super-computer as a collection of independent cores, and therefore do not explicitly take into account the memory hierarchy and multi-level parallelism available inside modern super-computers. Therefore, we first proposed a strategy for designing an efficient parallel implementation of the sweep operation on modern architectures by combining the use of the SIMD paradigm thanks to C++ generic programming techniques and an emerging task-based runtime system: PaRSEC. We demonstrated the need for such an approach using theoretical performance models predicting optimal partitionings. Then we studied the challenge of converging the source iterations scheme in highly diffusive media such as the PWR cores. We have implemented and studied the convergence of a new acceleration scheme (PDSA) that naturally suits our Hybrid parallel implementation. The combination of all these techniques have enabled us to develop a massively parallel version of the SN Domino solver. It is capable of tackling the challenges posed by the neutron transport simulations and compares favorably with state-of-the-art solvers such as Denovo. The performance of the PaRSEC implementation of the sweep operation reaches 6.1 Tflop/s on 768 cores corresponding to 33.9% of the theoretical peak performance of this set of computational resources. For a typical 26-group PWR calculations involving 1.02× 1012 DoFs, the time to solution required by the Domino solver is 46 min using 1536 cores.
منابع مشابه
Simulation massivement parallèle des écoulements turbulents à faible nombre de Mach. (Massively parallel simulation of low-Mach number turbulent flows)
HAL is a multi-disciplinary open access archive for the deposit and dissemination of scientific research documents, whether they are published or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destinée au dépôt et à la diffusion de documents scientifiques de niveau...
متن کاملDEM-based simulation of concrete structures on GPU
The benefit of using the Discrete Element Method (DEM) for simulations of fracture in heterogeneous media has been widely highlighted. However modeling large structure leads to prohibitive computations times. We propose to take advantage of Graphics Processor Units (GPUs) to reduce the computation time, taking advantage of the highly data parallel nature of DEM computations. GPUs are massively ...
متن کاملAdaptive Parallel Wavelet Method for the Neutron Transport Equations
The numerical solution of the time-dependent neutron transport problem in 2-D Cartesian geometry is considered. The problem is described by a coupled system of hyperbolic partial differential equations with the parameters of multiple groups. The system, for given group parameter, is discretized by a discrete ordinate method (SN) in angular direction and adaptive spline wavelet method with alter...
متن کاملAccelerating the Sweep3D for a Graphic Processor Unit
As a powerful and flexible processor, the Graphic Processing Unit (GPU) can offer a great faculty in solving many high-performance computing applications. Sweep3D, which simulates a single group time-independent discrete ordinates (Sn) neutron transport deterministically on 3D Cartesian geometry space, represents the key part of a real ASCI application. The wavefront process for parallel comput...
متن کاملDomain decomposition with discrete element simulations using shared-memory parallel computing for railways applications
Numerical simulation with discrete elements leads to several issues for large scale problems and long loading times, as for the granular dynamic simulations of the ballasted railway behavior. To reduce computational costs, we study the use of two strategies: domain decomposition methods and shared-memory parallelization with OpenMP. An example of a maintenance process, the tamping, on a portion...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015